Thematic ranking of object summaries for keyword search

نویسندگان

  • Georgios John Fakas
  • Yilun Cai
  • Zhi Cai
  • Nikos Mamoulis
چکیده

An Object Summary (OS) is a tree structure of tuples that summarizes the context of a particular Data Subject (DS) tuple. The OS has been used as a model of keyword search in relational databases; where given a set of keywords, the objective is to identify the DSs tuples relevant to the keywords and their corresponding OSs. However, a query result may return a large amount of OSs, which brings in the issue of effectively and efficiently ranking them in order to present only the most important ones to the user. In this paper, we propose a model that ranks OSs containing a set of identifying keywords (e.g., Chen) according to their relevance to a set of thematic keywords (e.g. Mining). We argue that the effective thematic ranking of OSs should combine gracefully IR-style properties, authoritative ranking and affinity. Our ranking problem is modeled and solved as a top-k group-by join; we propose an algorithm that computes the join efficiently, taking advantage of appropriate count statistics and compare it with baseline approaches. An experimental evaluation on the DBLP and TPC-H databases verifies the effectiveness and efficiency of our proposal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Effective Path-aware Approach for Keyword Search over Data Graphs

Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...

متن کامل

Ranking of Resulting Objects and Snippet Generation for Falcons

Semantic web search engine Falcons support keyword based search for linked objects by using comprehensive virtual document which it creates for each object. In our work we are suggesting idea of using Selectivity Estimation of triple patterns for ranking of resulting objects and generating snippet for the keyword query for Falcons Semantic web search engine. Selectivity of a triple pattern is t...

متن کامل

Ranking of Resulting Objects and Snippet Generation

Semantic web search engine Falcons support keyword based search for linked objects by using comprehensive virtual document which it creates for each object. In our work we are suggesting idea of using Selectivity Estimation of triple patterns for ranking of resulting objects and generating snippet for the keyword query for Falcons Semantic web search engine. Selectivity of a triple pattern is t...

متن کامل

Discovering and visualizing narrative themes

This paper presents a framework for indexing and browsing databases of stories, in particular characterizing and visually exploring each narrative’s thematic content. We introduce a method for discovering thematic content in texts via lexical dissimilarity statistics. A maximumlikelihood algorithm clusters words into pools of similar meaning, using a thesaurus for rough estimates of word sense ...

متن کامل

Effective Ranking of XML Keyword Search Results (Extended Version)

The popularity of XML has exacerbated the need for an easy-to-use, high precision query interface for XML data. When traditional document-oriented keyword search techniques do not suffice, natural language interfaces and keyword search techniques that take advantage of XML structure make it very easy for ordinary users to query XML databases. Unfortunately, current approaches to processing thes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Data Knowl. Eng.

دوره 113  شماره 

صفحات  -

تاریخ انتشار 2018